The application layer provides applica- 
tion programs that a robot can execute. 
Examples of application programs in- 
clude those needed to perform such pre- 
scribed maneuvers as avoiding obstacles 
while moving from a specified starting 
point to a specified goal point or turning 
a robot in place through a specified az- 
imuthal angle. Each robot is provided 
with application software representing its 
own unique set of commands. The soft- 
ware establishes a graphical user inter- 
face (GU I ) for exchanging command in- 
formation with external computing 
systems. Via the GUI and its supporting 
interface software, a user can select and 
assemble, from the aforementioned set, 
commands appropriate to the task at 
hand and send the commands to the 


robot for execution. System software that 
interacts with the R4SA software at all 
three levels establishes a synchronized 
control environment. 

The R4SA software features two 
modes of execution: before real time 
(BRT) and real time (RT). In the BRT 
mode, a text configuration file is read in 
(each robot hasitsown uniquefile) and 
then device-driver-layer, device-layer, 
and application -layer initialization func- 
tions are executed. If execution is suc- 
cessful, then the system jumps into the 
RT mode, in which the system is ready to 
receive and execute commands. 

One goal in developing the R4SA ar- 
chitecture was to provide one computer 
code for many robots. The unique exe- 
cutable code for each robot is built by 


use of a configuration feature file. The 
set of features for a given robot is se- 
lected from a feature database on the 
basis of the hardware and mechanical 
capabilitiesof that robot. Recompilation 
of code is straightforward: modifications 
can readily be performed in the field by 
use of simple laptop-computer develop- 
ment and debugging software tools. 

This work was done by H rand Aghazar- 
ian, Eric Baumgartner, and Michael Garrett 
of Caltech for NASA's Jet Propulsion L abora- 
tory. Further information is contained in a 
TSP (seepagel). 

The software used in this innovation is 
available for commercial licensing. Please 
contactKarina Edmonds of theC aliforni a In- 
stitute of Technology at (626) 395-2322. 
Refer to NPO-41796. 


mR 4SA for Controlling Robots 

NASA's Jet Propulsion Labordtory, Pasadena, California 


The R4SA GUI mentioned in the im- 
mediately preceding article is a user- 
friendly interface for controlling one or 
more robot(s). ThisGUI makes it possi- 
ble to perform meaningful real-time 
field experiments and research in robot- 
ics at an unmatched level of fidelity, 
within minutesof setup. It provides such 
powerful graphing modes as that of a 
digitizing oscilloscope that displaysup to 
250 variables at rates between 1 and 200 
Hz. ThisGUI can be configured asmul- 
tiple intuitive interfaces for acquisition 
of data, command, and control to en- 


able rapid testing of subsystems or an en- 
tire robot system while simultaneously 
performing analysis of data. 

The R4SA software establishes an intu- 
itive component-based design environ- 
ment that can be easily reconfigured for 
any robotic platform by creating or edit- 
ing setup configuration files. The R4SA 
GUI enables event-driven and condi- 
tional sequencing similar to those of 
Mars Exploration Rover (MER) opera- 
tions. It has been certified as part of the 
MER ground support equipment and, 
therefore, is allowed to be utilized in 


conjunction with MER flight hardware. 
The R4SA GU I could also be adapted to 
use in embedded computing systems, 
other than that of the MER, for com- 
manding and real-time analysis of data. 

T his work was done by H rand A ghazarian 
of Caltech for NASA's Jet Propulsion L abora- 
tory. Further information is contained in a 
TSP (seepagel). 

The software used in this innovation is 
available for commercial licensing. Please 
contactKarina EdmondsoftheCalifornia In- 
stitute of Technology at (626) 395-2322. 
Refer to NPO -41 797. 


@ Bio-Inspired Neural Model for Learning Dynamic Models 

This model could be a basis for fast speech- and image-recognition computers. 

NASA'sJet Propulsion Laboratory Pasadena, California 


A neural-network mathematical 
model that, relative to prior such mod- 
els, places greater emphasison some of 
the temporal aspects of real neural 
physical processes, has been proposed 
as a basis for massively parallel, distrib- 
uted algorithms that learn dynamic 
models of possibly complex external 
processes by means of learning rules 
that are local in space and time. The al- 
gorithms could be made to perform 
such functions as recognition and pre- 
diction of words in speech and of ob- 
jects depicted in video images. The ap- 


proach embodied in this model is said 
to be "hardware-friendly" in the follow- 
ing sense: The algorithms would be 
amenable to execution by special-pur- 
pose computers implemented as very- 
large-scale integrated (VLSI) circuits 
that would operate at relatively high 
speeds and low power demands. 

It is necessary to present a large 
amount of background information to 
give meaning to a brief summary of the 
present neural -network model: 

• A dynamic model to be learned by the 
present neural -network model is of a 


type denoted an internal model or pre- 
dictor. In simplest terms, an internal 
model is a set of equations that pre- 
dicts future measurements on the basis 
of past and current ones. Internal 
models have been used in controlling 
industrial plants and machines (in- 
cluding robots). 

• One of the conclusions drawn from 
Pavlov's famous experiments was the 
observation that reinforcers of learn- 
ing (basically, rewards and punish- 
ments) become progressively less effi- 
cient for causing adaptation of 
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behavior as their predictability grows 
during the course of learning. The dif- 
ference between the actual occurrence 
and the prediction of the reinforcer is 
denoted as the reinforcement predic- 
tion error signal. In what is known in 
the art as the temporal-difference 
model (TD model) of Pavlovian learn- 
ing, the reinforcement prediction 
error signal is used to learn a rein- 
forcement prediction signal. The de- 
sired reinforcement prediction signal 
is defined as a weighted sum of future 
reinforcement signals wherein rein- 
forcements are progressively dis- 
counted with increasing time into the 
future. The reinforcement prediction 
error signal progressively decreases 
during learning as the reinforcement 
prediction signal becomes more simi- 
lar to the desired reinforcement pre- 
diction signal. 

•Algorithms based on the TD model 
("TD algorithms") have been analyzed 
and shown to be implementations of a 
variant of dynamic programming. Ma- 
chine-learning studies have shown that 
TD algorithms are powerful means of 
solving reinforcement learning prob- 
lems that involve delayed reinforce- 


ment. Examples of such problems in- 
clude board games, which involve de- 
layed rewards (winning) or punish- 
ments (losing). 

•Mid-brain dopamine neurons are so 
named because they modulate the lev- 
els of activity of other neurons by 
means of the neurotransmitter chemi- 
cal dopamine. The cell bodies of 
dopamine neurons are located in the 
brain stem and their axons project to 
many brain areas. Activities of 
dopamine neurons have been found 
to be strikingly similar to the predic- 
tion error signals of the TD model. 

• Real neural signals include spikelike 
pulses, the times of occurrence of which 
are significant. The term "biological 
spike coding” denotes, essentially, tem- 
poral labeling of such pulses in real neu- 
ronsor in a neural-network model. 
Thisconcludesthe background infor- 
mation. 

The present neural-network model 
incorporates biological spike coding 
along with some basic principles of the 
learning by synapses in the cortex of 
the human brain. According to the 
learning rule of the model, synaptic 
weights are adapted when pre- and 


postsynaptic spikes occur within short 
time windows. I n simplified terms, for a 
given synapse and time window, the 
synaptic strength is increased in the 
long term if the presynaptic spike pre- 
cedes the postsynaptic spike or is de- 
creased in the long term if the presy- 
naptic spike follows the postsynaptic 
spike. This learning rule has been 
shown to minimize prediction errors, 
indicating that the neural network 
learns an optimal dynamic model of an 
external process. 

This work was done by Tuan Duong, Vu 
Duong, and Roland Suri of Caltech for 
NASA's]^ Propulsion Laboratory. 

In accordance with Public Law 96-517, 
the contractor has elected to retain title to this 
invention. Inquiries concerning rights for its 
commercial use should be addressed to: 

In nova ti veTechn ology A ssets Management 
JPL 

Mail Stop 202-233 
4800 Oak GroveDrive 
Pasadena, CA 91109-8099 
(818) 354-2240 
E-mail: iaoffice@jpl.nasa.gov 
Refer to N PO-41691, volume and number 
of this NASA Tech Briefs issue, and the 
page number. 


@E volutionary Computing Methods for Spectral Retrieval 

Solutions to the inverse problem of spectral retrieval are found in a computationally efficient 
process. 

NASA's Jet Propulsion Laboratory, Pasadena, California 


A methodology for processing spec- 
tral images to retrieve information on 
underlying physical, chemical, and/or 
biological phenomena isbased on evolu- 
tionary and related computational 
methods implemented in software. In a 
typical case, the solution (the informa- 
tion that one seeks to retrieve) consists 
of parameters of a mathematical model 
that represents one or more of the phe- 
nomena of interest. 

The methodology was developed for 
the initial purpose of retrieving the 
desired information from spectral 
image data acquired by remote-sens- 
ing instruments aimed at planets (in- 
cluding the Earth). Examplesof infor- 
mation desired in such applications 
include trace gas concentrations, tem- 
perature profiles, surface types, 
day/ night fractions, cloud/ aerosol 
fractions, seasons, and viewing angles. 
The methodology is also potentially 
useful for retrieving information on 


chemical and/ or biological hazards in 
terrestrial settings. 

In this methodology, one utilizes an it- 
erative process that minimizes a fitness 
function indicative of the degree of dis- 
similarity between observed and syn- 
thetic spectral and angular data. The 
evolutionarycomputing methodsthat lie 
at the heart of this process yield a popu- 
lation of solutions (sets of the desired pa- 
rameters) within an accuracy repre- 
sented by a fitness-function value 
specified by the user. The evolutionary 
computing methods (ECM) used in this 
methodology are Genetic Algorithms 
and Simulated Annealing, both of which 
are well-established optimization tech- 
niques and have also been described in 
previous NASA Tech Briefs articles. These 
are embedded in a conceptual frame- 
work, represented in the architecture of 
the implementing software, that enables 
automatic retrieval of spectral and angu- 
lar data and analysisof the retrieved solu- 


tions for uniqueness. This framework is 
composed of three modules (see figure) : 

1. Thecentral core, which consistsof the 
aforementioned ECM ; 

2. The synthetic-spectra generator, 
which, coupled with the ECM, gener- 
ates a population of automatically re- 
trieved spectral solutions; and 

3. The synthetic-spectra degeneracy ana- 
lyzer ("degeneracy" is used here in 
the mathematical sense of signifying 
the existence of multiple equally valid 
solutions for a given set of data and 
user-defined accuracy), which applies 
several well-established mathematical 
methods to characterize the unique- 
ness (or the degeneracy, which is es- 
sentially the lack of uniqueness) of 
the solutions within the population. 
One advantage afforded by this ECM- 

based methodology over traditional spec- 
tral retrieval methodsisthe abilityto per- 
form an automatic, unbiased search for 
all solutions within the entire parameter 
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